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1.  Introduction.  The  bio  assay  problem  of  estimating  the  median  effec- 
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tive  dose  of  a preparation  is  probably  as  old  as  any  of  the  important  sta- 
tistical problems. 

With  a few  exceptions  most  of  the  statistical  methods  used  are  derived 
from  fixed  sample  regression  techniques.  There  are,  however,  many  situations 
in  which  a sequential  method  would  seem  appropriate.  Furthermore,  the  tra- 
ditional bio-assay  techniques  rely  heavily  upon  the  assumed  parametric  model 
and  the  threshold  model  for  the  dose-response  curve  (see  Sec.  2). 

Stochastic  approximation  methods  of  the  Robbins-Monro  type  seem  to  be 
attractive  candidates  for  replacement  of  traditional  methods  when  a sequen- 
tial  design  is  feasible,  (e.g.  there  is  a short  time  delay  between  appli- 
cation of  treatment  and  the  response). 

Small  sample  simulations  which  have  been  carried  out  in  the  past  (Wethcrill 
(1963)  and  Cochran  and  Davis  (1965))  have  demonstrated  a very  good  perform- 
ance of  the  Robbins-Monro  method  versus  the  traditional  methods. 

In  this  paper  it  is  shown  that  the  Robbins-Monro  procedure  for  esti- 
mating the  Median  Effective  Dose  (ED,_0)  asy®Ptotically  optimal  in  ihe 
sense  that  it  yields  estimators  with  minimal  asymptotic  variance.  Reduction 
of  the  asymptotic  variance  to  the  minimum  in  the  usual  Robbins-Monro  process 
can  be  done  only  when  the  slope  of  the  dose-response  curve  at  the  estimated 
value  is  known  which  implies  among  other  things  a full  knowledge  of  the 


J 


dose-response  curve.  This  is  clearly  an  unrealistic  situation  in  many  cases. 
A few  authors  constructed  adaptive  methods  which  achieve  the  minimal  asymp- 
totic variance  in  a purely  non-parametric  setting.  -The  application  of  two 
such  techniques  due  to  Venter  (1967)  and  Anbar  (1976)  is  also  discussed. 

The  results  of  some  small  sample  simulations  are  given  in  Section  5. 

In  Section  2 a more  general  model  for  the  bio  assay  problem  is  developed. 
It  is  demonstrated  that  in  general  any  particular  assumption  about  the 
specific  form  of  the  dose-response  curve  is  just  as  arbitrary  as  any  other 
such  assumption.  In  general  one  would  not  want  to  assume  much  more  than  that 
the  curve  possesses  the  properties  of  a distribution  function  and  that  the 
ED^0  is  well  defined. 

In  Section  3 the  Robbins-Monro  procedure  for  estimating  the  ED^q  is 
discussed  and  its  optimality  property  is  proved.  Venter's  and  Anbar' s 
adaptive  versions  are  also  discussed  in  Section  4.  Section  5 is  dedicated 
to  some  numerical  studies. 

2.  A model  for  the  bio-assay  problem.  The  bio-assay  problem  is  centered 
around  the  question  of  obtaining  efficient  techniques  for  estimating  the 
median  effective  dose  (ED^q)  for  a new  preparation. 

The  EL'^q  is  defined  relative  to  a population  of  experimental  animals  and 
the  following  model  is  assumed: 

(i)  To  every  animal  in  the  population  corresponds  a fixed  dose  x, 
so  called  "just  effective  dose",  such  that  any  dose  smaller  than 
x would  produce  no  reaction  when  administered  to  the  animal  and 
any  dose  larger  than  x would  produce  a full  reaction. 

(ii)  The  "just  effective  doses"  are  distributed  in  the  animal  popu- 
lation according  to  the  distribution  F(^i-)  where  F is  a 

o 

known  distribution  function  and  u and  o > 0 are  unknown 
location  and  scale  parameters. 
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Clearly,  when  an  animal  is  selected  at  random  from  the  pooulation  and  a dose 
x is  administered  to  it,  the  probability  that  it  would  react  to  this  dose 
is  If  we  assume  that  F is  symmetric  around  zero,  then  y,  is  the 

G 

median  of  the  "just  effective  doses"  in  the  population.  F is  called  the 
"dose-response  curve"  or  the  "tolerance  curve"  and  y is  the  ED^q. 

Thus  the  bio-assay  problem  is  reduced  to  the  problem  of  estimating  y. 

The  different  models  and  techniques  used,  result  from  different  choices  of 
F which  is  typically  selected  to  be  either  the  normal  or  the  logistic  dis- 
tributions. \ 

The  threshold  theory  which  has  been  just  described  seems  quite  artifi- 
cial. The  resistance  of  an  individual  animal  to  a given  drug  depends  upon 
a multitude  of  variables  such  as  diet,  environmental  conditions  and  various 
physiological  processes  which  cannot  possibly  be  all  controlled.  Thus  one 
may  consider  the  following  general  model.  Let  U be  some  abstract  space. 
Think  of  a point  uQ  in  U as  a complete  description  of  all  relevant  vari- 
ables. Let  o(u,x)  be  the  probability  of  obtaining  a reaction  given  the 
conditions  u € U and  that  dose  x was  administered.  Assume  that  d satis- 
fies the  following  two  conditions: 

(i)  For  every  fixed  u <=  U , e(u,x)  is  monotone  non-decreasing  in 

x. 


(ii)  lira  o(u,x)  = 1 , lim  o(u,x)  = 0 for  every  u € U. 

X-WO  x-*-«*> 

Let  A be  a probability  measure  on  U.  The  function 

(l)  M(x)  = j*  o(u,x)  d'A(u) 

U 

is  the  probability  that  a randomly  selected  animal  would  react  to  the  dose  x. 
Clearly,  M(x)  is  monotone  non-decreasing  with  lim  M(x)  = 0 and 

X-»-«o 

lim  M(x)  = 1.  The  function  M will  be  called  the  tolerance  curve.  If  the 

x-*» 


equation  M(x)  = l/2  has  a unique  solution  at  x = q , q is  the  EDrQ. 

A SPECIAL  CASE.  Let  a(u)  , u £ U be  a real  valued  function  on  U. 
Assume  that  o(u,x)  depends  upon  u only  through  a(u)  and  that 


(2) 


q(u,x)  = *(x  - a(u))  where  % is  a monotone 


non-decreasing  function  of  a real  variable  satisfying  0 < *(t)  < 1 and 

^(t)  = 1 - *(-t)  for  all  t.  Furthermore  assume  that  the  distribution  of 

a(u)  induced  by  the  measure  is  given  by  F(^i-)  where  F is  symmetric 

o 

about  zero  and  ti  and  a > 0 are  location  and  scale  parameters.  In  this 
case 

OB  OB 

M(u  + x)  = J*  t(u  + x - a)  dF(^)  = j*  *(x-t<j)dF(t) 

.OB  -OB 

00  00 

= 1 - J*  *(to-x)  dF(t)  = 1 - j*  ^(-to-x)dF(t) 

-OB  -00 

= 1 - J*  *(u-x-a)dF(^±-)  = 1 - M(u-x)  . 

-00  ° 

M is  obviously  monotone  non-decreasing.  Furthermore,  M(u)  = l/2  and  if 
M has  a positive  derivative  at  x = u , then  u is  the  unique  solution 
of  the  equation  M(x)  = l/2.  The  threshold  theory  is  obtained  when  a(u) 
is  the  "just  effective  dose"  and  * is  chosen  to  be  the  function 


(3) 


*(t)  = / 


if  t < 0 


if  t > 0 


It  should  be  noted  that  the  tolerance  curve  M(t)  is  identical  with 
F(— — ) if  and  only  if  !>  is  of  the  simple  form  (3). 

G 

If  f has  a continuous  derivative  at  some  neighborhood  of  u then  one 
can  easily  check  that 
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(4) 


M'(u)  = - — T *(t)  f'(t/<j)dt  , 
a -“ 


where  M' (u)  = M(  t ) j 


t=u 


When  t is  given  by  (3),  (4)  simplifies  to 


(5) 


M'  (u) 


f(o) 


Remark.  If  one  is  willing  to  accept  that  the  model  introduced  in  this  sec- 
tion is  more  realistic  than  the  threshold  model,  then  even  if  one  knows  the 
particular  form  of  in  (l)  (which  will  be  probably  very  rare  in  practice), 
the  particular  form  of  the  dose-response  curve  M will  still  be  unknown. 

Thus  one  should  be  very  weary  of  using  methods  which  depend  heavily  upon 
the  mathematical  form  of  M without  studying  their  robustness  against 
deviations  from  it. 

3.  Stochastic  apuroxi mat ions.  Stochastic  approximations  of  the 
Robbins-Monro  type  are  sequential  procedures  which  are  designed  to  estimate 
the  zero  of  a regression  function  when  the  investigator  can  only  assume 
some  general  properties  of  the  function  but  not  its  mathematical  from. 

The  problem  of  estimating  the  ED,. ^ (or  any  other  percentile  of  the  dose- 
response  curve)  falls  under  this  category  of  problem  since  the  ED,.q  is 
the  solution  of  the  equation  M(x)  - l/2  = 0. 

The  Robbins-Monro  procedure  is  the  following.  Let  H(x)  be  a real 
valued  function  of  a real  variable  x.  Let  q be  the  unique  solution  of 
the  equation  H(x)  = 0.  Let  fY  1 be  an  observable  family  of  random  vari- 
ables such  that  EY^  = H(x)  for  every  x.  Let  f a 1 be  n sequence  of 

00  00  2 

non-negative  real  numbers  such  that  S a = ® and  T,  a < ». 

i n , n 

n=l  n=l 
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Let  X^  be  a random  variable  and  define 

(6)  Xn+1  = Xn  - Yn  n 1,2,... 

where  Y is  a random  variable  with  conditional  distribution  given 
n 

(X.  **,>...»  X = x ) the  same  as  the  distribution  of  Y . 
li  n n x 

n 

The  process  (6)  is  the  Robbins-Monro  process.  Under  some  regularity 

conditions  Xr  -»  9 with  probability  one.  (See  e.g.  Blum  (1954)).  For 

a = a'1  , n1//2(X  -0)  converges  in  law  to  a normal  random  variable  with 

n n ’ n 

2 2 

mean  zero  and  variance  a A /(2Aa-l)  where  oc  > 0 is  the  derivative  of 

P p 

H at  x = 9 , a = lim  E(Y  - H(x))  and  A is  chosen  such  that  2Aa  > 1 

x-*e  x 

(Sacks  (1958)). 

The  problem  of  efficiency  of  the  Robbins-Monro  process  as  measured  by 
the  asymptotic  variance  was  studied  by  several,  authors.  One  can  easily 
verify  that  the  choice  A = of ^ minimizes  the  asymptotic  variance.  When 
a is  unknown  an  adaptive  process  can  be  constructed.  One  adaptive  process 
was  suggested  by  Venter  (1967),  Another  process  was  studied  by  Anbar  (1976) 
The  two  processes  are  optimal  in  the  sense  that  their  asymptotic  variance  is 
minimal.  However,  while  Venter's  procedure  requires  taking  two  observations 
at  each  approximation,  in  Anbar’ s method  the  observations  are  taken  one  at 
a time.  A different  way  of  reducing  the  asymptotic  variance  was  studied  by 
Anbar  (1973)  and  Abdelhamid  (1973).  They  have  shown  that  when  Yx  is  dis- 
tributed according  to  F(y-H(x))  where  F is  a known  distribution,  one 
can  minimize  the  asymptotic  variance  by  applying  a suitable  transformation 
to  the  Yn's.  When  0£  is  known,  the  optimal  transformation  yields  a 
most  efficient  estimator  in  the  sense  that  the  asymptotic  variance  equals 
to  the  Cran£r-Rao  lower  bound  for  estimating  the  zero  when  the  regression 
function  is  linear.  Fabian  (1973)  have  constructed  a process  which  achieves 
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the  Cramlr-Rao  lower  bound  also  when  a is  not  known. 

The  results  of  Abdelhamid,  Anbar  and  Fabian  cannot  be  applied  to  the 
bio-assay  situation  because  the  family  of  distributions  of  the  Yx's  is 
not  a shift  family  generated  by  a fixed  distribution.  However,  as  it  is 
shown  in  this  paper,  in  the  bio-assay  case  the  Robbins-Monro  process  yields 
a most  efficient  procedure  when  a is  known  and  hence  both  Venter's  and 
Anbar' s modifications  can  be  used  to  produce  a most  efficient  sequence  of 
estimators  when  cc  is  unknown. 

To  apply  the  Robbins-Monro  procedure  to  the  bio-assay  problem  assume 
that  when  a dose  x is  applied  to  N subjects  the  investigator  observes 
the  random  variable  Nx  - the  number  of  subjects  reacting  to  the  dose  x. 

If  the  tolerance  curve  is  M(x)  then  Nx  is  a binomial  variable  with 
parameters  (N,M(x)).  Let  be  the  initial  dose  chosen  by  the  investigator. 

For  n = 1,2,...  define 

(7)  Vi  ’ x„  - to'X-Va) 

where  P = N /N  . 
n x ' 
n 

The  relation  (7)  determines  sequentially  a sequence  of  doses  to  be 
applied. 


tot ic ally  optima 


Assume  that  the  tolerance  curve 


M(x)  is  monotone  non-decreasing  talcing  on  values  in  [0,1]  and  that  M' (x) 
exists  and  continuous  in  some  neighborhood  of  u.  and  M'(u)  > 0.  Denote 
CL  = M'  (n) . It  is  easy  to  verify  that  in  this  case  the  conditions  for  con- 
vergence and  asymptotic  normality  of  the  process  (7)  are  satisfied  and  thus 


Xn  -»  a with  probability  one  and  if  2QA  > 1 then  /n[Xn-u ) converges  to  a 

2 

normal  random  variable  with  mean  zero  and  variance  A /4n(23A-1). 
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F i 


Denote  P = N /N.  Let  g be  a function  and  define  M (x)  = Eg(P  ). 

x x 8 x 


Assume  that 


a)  M (x)  is  a tolerance  curve. 

b)  M (u)  = 1/2  . 

© 

c)  M (x)  is  continuously  differentiable  at  x = ^ with  M' (u)  > 0. 

8 8 


f i jj| 

I 

t 


One  can  define  another  approximation  process  by  replacing  Pn  in 

(7)  by  g(P  ).  If  in  addition  one  substitutes  in  (7)  A = fM'  (u ) 3 ~ then 
n g 

(r) 

-»  u.  with  probability  one  and 

yr(x<g) . „)  i k(o,0|) 


where 


at  = lin  E[g(P J - M (x)  ]2/  [M*  (p, ) ]2 


■ Var[g(P  )]/[M'(u)r  . 

u g 


M (x)  = E g(j/N)(^)  ~ Q!?(x)(l  - M(x))N"J]  . 

g j=0  J 031 


Simple  calculations  yield 


Mg(x)  " mCxIIi-mIxTI  * COV^(Px^  PX) 


= M (x)  cov(g(Px),Px)/var(Px)  . 


By  the  Cauchy-Schwartz  inequality  one  readily  obtains 


Var(g(P  ))  Var  P 

A ^ A 

(Mg(x) )2  “ (m'(x))2 


for  all  x. 


The  right  hand  side  of  (10)  at  x = y,  is  the  asymptotic  variance  of  the 
Robbins-Monro  process  (7).  Since  equality  in  (10)  is  achieved  if  and  only 
if  g(x)  = cx  for  some  constant  c one  obtains  that  the  Robbins-Monro 


procedure  is  indeed  optimal  for  this  problem.  Note  that  at  x = y,  the 
right  hand  side  of  (10)  equals  to  [Un&2]-^  which  is  the  Cramer-Rao  lower 


bound  for  estimating  y,  when  the  tolerance  curve  is  linear.  Thus  the 


process  (7)  is  most  efficient  when  A = of  \ 


When  a is  unknown  most  efficient  processes  exist.  They  are  adaptive 


modification  of  (7).  Let  us  describe  two  such  procedures. 


Venter's  procedure.  The  first  approximation  (dose)  is  arbitrary.  For 

n > 1 the  (n+l)st  approximation  is  determined  by  the  recursive  relation. 


1 i » " 

X ..  = X - d A ±(Y  + Y ) 
n+1  n n n 2'  n n 


where  Y and  Y are  observations  at  X + c and  X - c respectively 
nn  nnnn^^ 


(dn)  and  (c^  are  sequences  of  positive  numbers  satisfying 

dn  = n_1(l  + oCn”'1/2))  and  dR  = cn_Y  (c  > 0 , 0 < y < l/2)  and  An  is 


an  estimator  for  0!  determined  as  follows. 


Denote  the  truncation  of  a function  f in  the  interval  [a,b]  by 


[f]  . That  is 

€L 


[f(x)]a=b  if  f(x)  > b 


= f (x)  if  a < f (x)  < b 


= a if  f(x)  < a . 





where  a and  b are  two  known  numbers  satisfying 


a < a < b.  Venter  has  shown  the  following. 


2.  For  l/b  < y < l/2  , n ' (Xn  - p,)  converges  in  distribution  to  a 

2 2 

normal  random  variable  with  mean  zero  and  variance  a /2a  . For 
Y = 1/4  the  asymptotic  distribution  of  n1^2(Xn~  p,)  is  normal 
with  mean  - aot^c^/d!  and  variance  <j2/ 2a2  , where  a^  is  the 
second  derivative  of  M at  x = u . 

Venter  also  investigated  the  rate  of  convergence  of  A to  a.  As  it  turns 

n 

out  the  rate  of  convergence  of  An  depends  upon  the  choice  of  y &ncl  c. 
The  smaller  y is  and  the  larger  c is  the  better  is  the  rate  of  conver- 
gences. However,  when  y < l/b  a bias  is  introduced  in  the  estimation  of 
H which  increases  with  c.  Thus  Venter  recommended  to  use  y = l/4 
and  a moderate  value  of  c.  In  the  next  section  the  performance  of  Venter’s 
procedure  is  examined  for  various  values  of  c via  computer  simulations. 

Anbar's  procedure.  The  first  approximation  X^  is  chosen  arbitrarily.  For 
n > 1 the  (n+l)st  approximation  is  determined  by  the  recursive  relation 

(12)  X = X - A-1  n"1  Y 

n+1  n n-1  n 

where  Y is  an  observation  at  X and  A = [B  ]b  where  B is  given  by 
n n n n L n a n * 

E (X.  - X ) Y 

B - a-* D 

n n 0 

e (x.  - x r 

. ' i n 

i=X 


for  n > 2 and  BQ  and  B1  are  arbitrary  positive  numbers.  The  numbers 
a and  b are  assumed  to  be  known  and  a < Ot  < b.  Anbar  (1976)  has  shown 
that  An  -*  a a.s.  and  n1//2(Xn  - p.)  converges  in  distribution  to  a normal 
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random  variable  with  mean  zero  and  variance 


Both  Venter  and  Anbar  have  suggested  to  compute  by  summing  for 

some  integer  m = m(n)  to  n instead  of  from  1 to  n in  order  to  disregard 
large  deviations  which  may  occur  in  the  first  approximation. 


5.  Numerical  results.  In  this  section  some  numerical  studies  are 
summarized.  For  reference  purposes  the  numerical  simulations  were  carried 
out  along  the  same  lines  as  the  studies  of  Cochran  and  Davis  (1965).  The 
dose-response  curve  was  chosen  to  be  the  standard  normal  cumulative  curve. 
Venter’s  and  Anbar' s procedures  were  simulated  for  different  total  number 
of  experiments  available  (n  = 12,  24)  and  different  number  of  experiments 
performed  at  each  stage  (m  = 1,2,3).  The  mean  square  error  and  the  bias 
were  estimated  on  the  basis  of  100  independent  repetitions  of  each  simula- 
tion. We  have  followed  Venter's  suggestion  and  chose  y = l/4.  Tables  1 
through  6 give  the  MSE  and  bias  for  both  Anbar’ s and  Venter's  methods  as 
functions  of  the  starting  point  x^  for  the  different  values  of  n and  m. 

Venter's  method  was  simulated  for  different  values  of  c.  The  slope  at 
-l/2 

x = n is  cc  = (2jt)-  ' = .39S9.  The  truncation  points  for  the  estimating 
the  slope  in  both  methods  were  chosen  to  be  a = .5 0J  = .1995  and 
b = 1.5a  = .5924. 

An  examination  of  the  simulations  results  reveals  some  interesting 
phenomenae.  The  most  striking  phenomenon  is  the  higher  dependability  of 
Venter's  method  on  the  value  of  the  initial  dose  X^.  Both  MSE  and  bias  tend 
to  increase  fairly  rapidly  with  X^  . This  is  generally  true  for  all  values 
of  c.  However,  the  sensitivity  of  Venter's  Method  to  error  in  the  initial 
dose  increases  significantly  with  c.  On  the  other  hand  if  the  initial  dose 
is  close  to  the  ED_rt  a high  value  of  c produces  very  small  MSE  and  bias. 
As  compared  to  Venter's  method  Anbar' s method  tends  to  produce  MSE's 


.2- 


comparable  to  or  higher  than  Venter's  when  the  error  in  the  initial  dose  is 
small.  For  large  error  in  the  initial  dose  Anbar's  method  yields  MSE's  and 
biases  significantly  smaller  than  Venter's  method.  It  seems  that  Anbar's 
method  5s  fairly  insensitive  to  errors  in  the  initial  dose.  Comparing 
with  Cochran  and  Davis  results,  the  MSE  involved  in  using  Anbar's  method 
are  very  similar  to  those  obtained  by  using  the  optimal  Robbins-Monro  pro- 
cedure when  the  slope  of  the  dose-response  curve  at  the  ED_n  is  known. 
Another  important  observation  is  that  the  bias  component  in  the  MSE  is 
typically  much  larger  in  Venter's  method  than  in  Anbar's.  This  is  to 
expected  for  large  values  of  n since  for  y = l/4  Venter's  procedure  is 
asymptotically  biased.  It  seems  that  this  fact  shows  up  also  for  very  small 
values  of  n.  The  values  of  the  slope  estimates  are  not  reported  here. 

They  were  computed  in  the  study.  Venter's  procedure  almost  invariably 
over  estimated  the  slope.  In  fact  almost  all  values  were  very  close  to  the 
upper  truncating  point.  On  the  other  hand,  Anbar's  method  produced  reason- 
able estimates. 

To  summarize,  if  one  has  a very  good  apriory  knowledge  about  the 
location  of  the  El)  ^ , Venter's  method  will  produce  very  efficient  esti- 
mators.  If  on  the  other  hand  one  does  not  have  a very  reliable  prior  know- 
ledge. the  use  of  Anbar's  method  involves  a very  little  risk. 
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